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AMENDMENTS TO THE CLAIMS 

Please add or amend the claims to read as follows, and cancel without prejudice or 
disclaimer to resubmission in a divisional or continuation application claims indicated as 
cancelled: 

1, (Canceled) 

2. (Currently amended) An automatic method of classifying molecules , wherein the 

molecules are nucleic acids or amino acids, having similar biological function 
sequence comprising the steps of: 

a) automatically creating a hierarchical organization of the molecules in a 
database comprising the steps of: 

i calculating pairwise similarities between said molecules in said 
database by combining at least one standard measure of similarity, resulting in a 
first set of expectation values of similarity; 

ii analyzing said first set of expectation values of similarity so as to 
... .. obtain a second set of expectation values of similarity, wherein said molecules of 

the second set have a high degree of similarity; 

iii merging said resulting molecules of step ii) so as to form clusters, 
wherein only molecules with an expectation value below a first restricted 
threshold are merged; 

iv identifying groups of clusters that are neighbors in the protein space 
graph using local consideration resulting in related clusters; 

v determining the relationship between related dusters clusters that are 
neighbors in the protein space graph in said groups; 

vi analyzing said groups of the related clusters of step v), based on the 
threshold thereby creating a hierarchical organization of said molecules; 
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b) automatically determining the position of a selected molecule based 
on the hierarchical organization of step (a) of a oeloctod molecule in said 
database bv a method based on the hierarchical organization of step a , comprising 
the steps of: 

i identifying pairwise similarities between said selected molecule to the 
molecules in said database by combining at least one standard measure of 
similarity, resulting in a third set of expectation values of similarity; 

ii identifying geometric averaging of said selected molecule to each of 
said resulting clusters in said hierarchy, resulting in a forth fourt h set of 
expectation values of similarity; 

iii identifying related clusters from said hierarchy of step a, having a 
geometric averaging with said selected molecule below a second threshold [[,]] ; 
thereby determining the position of a selected molecule in the position of the 
hierarchical organization of step a classifying molecules having similar biologic 
function. 

3. (Currently amended) The method of claim 2, wherein each related cluster resulting from 
step iv is further analyzed, wherein if said geometric averaging of said related cluster to 
a second related cluster is below said first threshold a connection is established 
between two clusters, said connection is merging the two clusters . 

4. (Currently amended) The method of ciaim claim 3, .wherein said related cluster 
resulting from step iv is analyzed by calculating the geometric averaging between 
molecules in one cluster to molecules in the other cluster . 

5. (Currently amended) The method of claim 2, wherein the method of 
determining the relationship between said related clusters in step v further 
comprises the steps of applying a global test on said connected related clusters for 
identifying nuclei of strong relationships within said groups of clusters, 
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wherein each cluster is checked against its nearest cluster by using said 
geometric averaging, wherein if said geometric averaging is below said first 
threshold then the two clusters are merged. 

6, (Currently amended) The method of claim 2, further comprises repeating steps iv, 
v while each time raising the threshold of steps iv and v in a step wise manner 
until a third threshold is achieved, 

7. (Currently amended) The method according to claim 2, wherein said groups 
containing the clusters of every two molecules from different clusters w&h that 
have expectation value below the threshold. 

8 (Currently amended) The method according to claim 2, wherein the method 
further comprises repeating steps iv and v and applying three standard measures of 
similarity. 

9. (Original) The method according to claim 8, wherein said standard measures of 
similarity are Smith Waterman (SW), FASTA, and BLAST.. 

10. (Currently amended) The method according to claim 9, further comprising the 
step of applying a numerical normalization to the expectation value of the SW, the 
FASTA and the BLAST. 

11. (Original) The method according to claim 10, wherein the method step a) ii) is 
analyzed using thresholds 0.1, 0.1 and 10" 3 for SW, FASTA and BLAST 
respectively wherein a value of pairwise similarities between two protein sequences 
is maintained if either SW or FASTA yield an expectation value < 0.1 or BLASTS 
expectation value is < 1 0~ 3 

12. (Currently amended) The method according to claim 11, further comprising 
filtering the results of said pairwise similarities of BLAST thereby excluding low 
complexity segments, using the SEG program; or setting a more stringent threshold 
for BLAST at 10* if filtering decreased the number of related sequences by half 
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13. (Currently amended) The method according to claim 12, further comprising using 
said SEG program and maintaining a value of an expectation value of similarity 
between two protein when all three methods yield an expectation value < 1 , 

14. (Currently amended) The method of claim [[1]] 2, wherein the molecules are nucleic 
acids, 



15. (Currently amended) The method of claim [[I]] 2, wherein the nucleic acids are 
DNAorRNA, 



16. (Currently amended) The method of claim [[1]] 2, wherein the molecules are 
pofvnuclcotidos polynucleotide . 



17, (Previously presented) The method of claim 2, wherein the molecules are proteins. 



18. (Previously presented) The method of claim 2, wherein the molecules are peptides. 



19. (Cancelled) 



20. (Cancelled) 



21. (Currently amended) The method of claim 2, wherein the molecules encode or comprise 
immunoglobulins. 



APPLICANT(S): LINIAL, Michal et ah 

SERIAL NO.: 09/601,278 

FILED : February 22, 200 1 
Page 7 



22. (Cancelled) 

23. (Previously Presented) The method of claim 2, wherein said database is the 
Swissprot database. 

24. (Canceled) 

25. (Canceled) 

26. (Canceled) 

27. (Currently amended) A system for identifying biological families of molecules 

having similar biologic function sequence , said system comprising : 

(I) a storage device containing a first database identifying 
molecule sequences and their corresponding hierarchy; 

(II) a processor connected to said storage device and in 
communication with a second database containing molecule 
sequences; 

(III) an input device connected to said processor for inputting a 
sequence of a selected molecule to be analyzed, wherein 
said processor is programmed for automatically 
classifying molecules having similar hierarchical position, 
biologic function , said classifying comprising the steps of: 

a) automatically creating a hierarchical organization of the molecules in a 
database comprising the steps of: 

i calculating pairwise similarities between said molecules in said 
database by combining at least one standard measure of similarity, resulting in a 
first set of expectation values of similarity; 
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ii analyzing said first set of expectation values of similarity so as to 
obtain a second set of expectation values of similarity, wherein said molecules of 
the second set have a high degree of similarity; 

iii merging said resulting molecules of step ii) so as to form clusters, 
wherein only molecules with an expectation value below a first restricted 
threshold are merged; 

iv identifying groups of clusters that are neighbors in the protein space 
graph using local consideration resulting in related clusters; 

v determining the relationship between related du s ters clusters that are 
neighbors in the protein space graph in said groups; 

vi analyzing said groups of the related clusters of step v) 3 based on the 
threshold thereby creating a hierarchical organization of said molecules; 

b) automatically determining the position of a selected molecule based 
on the hierarchical organization of step fa ) of a selected molecule in said 
database by a method based on the hierarchical organization of atop a , comprising 
the steps of: 

i identifying pairwise similarities between said selected molecule to the 
molecules in said database by combining at least one standard measure of 
similarity, resulting in a third set of expectation values of similarity; 

ii identifying geometric averaging of said selected molecule to each of 
said resulting clusters in said hierarchy, resulting in a ierth fourt h set of 
expectation values of similarity; 

iii identifying related clusters from said hierarchy of step a, having a 
geometric averaging with said selected molecule below a second threshold [[J] : 
thereby determining the position of a selected molecule in the position of the 
hierarchical organization of step a classifying molecules having similar biologic 
function. 
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28, (Cancelled) 



29. (New) The method of claim 2, further comprises the step of classifying the molecules 
as having similar function by attributing said function on the position in the 
hierarchical organization. 



